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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C. 103(a) which fomis the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

2. Claims 1 , 6, and 10-12 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Shriberg et al. (incorporated by reference) in view of Elko et al. (US 
4741038). 

3. Regarding claims 1 and 11-12, Shriberg et al. disclose a method, a computer 
usable medium having computer readable program code, and a digital audio processing 
system for the segmentation of an audio stream into semantic or syntactic units wherein 
the audio stream is provided in a digitized format, comprising the steps of: detemiining a 
fundamental frequency for the digitized audio stream {Section 2. 1.2.3 on page 133); 
detecting changes of the fundamental frequency in the audio stream {pages 134-135, 
refer to figure 4); detennining candidate boundaries for the semantic or syntactic units 
depending on the detected changes of the fundamental frequency {pages 134-135); 
extracting and combining a plurality of prosodic features in the neighborhood of the 
candidate boundaries {section 2. 1. 1 on page 130 and section 2. 1.4 on page 137); and 
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determining boundaries for the semantic or syntactic units depending on the at least 
one prosodic feature {pages 134-135, FO is a prosodic feature). 

Shriberg et al. fail to specifically disclose the step of detecting the changes of the 
fundamental frequency includes providing a threshold value for estimates of the 
fundamental frequency's voicedness and determining whether the voicedness of the 
fundamental frequency estimates are higher or lower than the threshold value, and 
wherein the voicedness of the fundamental frequency estimates lower than the 
threshold value equals no voice, and wherein the voicedness of the fundamental 
frequency estimates higher than the threshold value equals voice. However, Elko et al. 
teach the step of detecting the changes of the fundamental frequency includes providing 
a threshold value for estimates of the fundamental frequency's voicedness and 
determining whether the voicedness of the fundamental frequency estimates are higher 
or lower than the threshold value, and wherein the voicedness of the fundamental 
frequency estimates lower than the threshold value equals no voice, and wherein the 
voicedness of the fundamental frequency estimates higher than the threshold value 
equals voice (co/. 11, lines 29-39). 

Since Shriberg et al. and Elko et al. are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Shriberg et al. by incorporating the teaching of Elko et al. 
in order to enable the system to pay more coding emphasis on the voice portion than 
unvoiced portion to reduce processing time and increase transmission rate. 
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4. Regarding claims 6 and 10, Shriberg et al. further disclose the method of claim 1 , 
wherein at least one prosodic feature is represented by the fundamental frequency 
{Section 2.1.1, page 130) and the step of performing a prosodic feature classification 
based on a predetennined classification tree (section 2.1.2 on page 131, grouping 
features). 

5. Claims 3-5, 8, and, 13 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Shriberg et al. (incorporated by reference) in view of Elko et al. (US 
4741038), as applied to claim 1, and further in view of Yeldener et al. (US 5774837). 

6. Regarding claims 3 and 1 3, Shriberg et al. fail to specifically disclose a method 
for defining an index function for the fundamental frequency having a value =0 if the 
voicedness of the fundamental frequency is lower than the threshold value and having a 
value =1 if the voicedness of the fundamental frequency is higher than the threshold 
value. However, Yeldener et al. teach a method for defining an index function for the 
fundamental frequency having a value =0 if the voicedness of the fundamental 
frequency is lower than the threshold value and having a value =1 if the voicedness of 
the fundamental frequency is higher than the threshold value {col. 14. In. 4-55, the goal 
is to use 0 and 1 to represent for unvoiced and voice portions, respectively). 

Since Shriberg et al. and Yeldener et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Shriberg et al. by incorporating the teaching of 
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Yeldener at al. in order to enable the system to pay more coding emphasis on the voice 
portion than unvoiced portion to reduce processing time and increase transmission rate. 

7. Regarding claims 4-5, Shriberg et al. further disclose a method for extracting at 
least one prosodic feature in an environment of the audio stream where the value of the 
index function is equal 0 {section 2.1.1 on page 130 discusses feature extraction of both 
voice and unvoiced portions), that the environment is a time period between 500 and 
4000 milliseconds {Section 2.1.1 on page 130). 

8. Regarding claim 8, Shriberg et al. do not disclose a method that first detect 
speech and non-speech segments in the digitized audio stream and performing the 
steps of claim 1 thereafter only for detected speech segments. However, Yeldener et al. 
teach a method that first detect speech and non-speech segments in the digitized audio 
stream and performing the steps of claim 1 thereafter only for detected speech 
segments {col. 14, In. 5-55). 

Since Shriberg et al. and Yeldener et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Shriberg et al. by incorporating the teaching of 
Yeldener et al. in order to enable the system to pay more coding emphasis on the voice 
portion than unvoiced portion to reduce processing time and increase transmission rate. 
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9. Claims 9 and 14 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Shriberg et al. (incorporated by reference) in view of Elko et al. (US 4741038). in view of 
Yeldener et al. (US 5774837), as applied to claims 8 and 13 above, and further in view 
of Eryilmaz (US 5867574). 

10, Regarding claims 9 and 14, the modified Shriberg et al., as applied to claims 8 
and 13 above, fail to disclose a method of detecting of speech and non-speech 
segments comprises utilizing the signal energy or signal energy changes, respectively, 
in the audio stream. However. Eryilmaz teaches a method of detecting of speech and 
non-speech segments comprises utilizing the signal energy or signal energy changes, 
respectively, in the audio stream {col. 3, In. 40 to coL 4, In. 54). 

Since the modified Shriberg et al. and Eryilmaz are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Shriberg et al. by incorporating the 
teaching of Eryilmaz in order to enhance the detection of voice portion in the signal 
when background noise is present. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Huyen X. Vo whose telephone number is 571-272-7631 . 
The examiner can normally be reached on M-F, 9-5:30. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wayne Young can be reached on 571-272-7582. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more infonnation about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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